Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
281	Functional Assessment and Intervention Design © SG Friedman, 1. Observe and operationally define the target behavior. Add to Reading List Source URL: www.behaviorworks.org Language: English - Date: 2010-06-04 12:18:30 Behaviorism Reinforcement Learning Applied behavior analysis Positive behavior support
282	Reinforcement Learning Techniques in ’Jumper’ MITCHELL BRUNTON and SHAANAN N. COHNEY University of Melbourne General Terms: Machine Learning, Jumper Add to Reading List Source URL: cohney.info Language: English - Date: 2016-01-24 20:49:37 Game artificial intelligence Search algorithms Minimax Alphabeta pruning Principal variation search Artificial neural network Variation Reinforcement learning Pruning Game tree TD-Gammon Tree traversal
283	Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA Add to Reading List Source URL: dept.stat.lsa.umich.edu Language: English - Date: 2012-09-12 18:50:24 Markov models Markov processes Stochastic optimization Mathematical optimization Operations research Reinforcement learning Markov decision process Algorithm Multi-armed bandit Dynamic programming Shortest path problem PP
284	Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric Add to Reading List Source URL: victorgabillon.nfshost.com Language: English - Date: 2010-07-01 09:47:14 Mathematics Mathematical analysis Artificial intelligence Backgammon Rollout Markov decision process Multi-armed bandit Reinforcement learning Inverted pendulum Pendulum Prime-counting function Valuation
285	Deep Reinforcement Learning for Flappy Bird Kevin Chen Stanford University Abstract Reinforcement learning is essential for training an agent to Add to Reading List Source URL: cs229.stanford.edu Language: English - Date: 2015-12-14 19:46:07
286	DAGGER and Friends References: 1. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning, Ross, Gordon & BagnellDAGGER algorithm 2. Reinforcement and Imitation Learning via Inte Add to Reading List Source URL: rll.berkeley.edu Language: English - Date: 2015-10-15 00:39:05
287	Fear the REAPER: A System for Automatic Multi-Document Summarization with Reinforcement Learning Cody Rioux Sadid A. Hasan Yllias Chali University of Lethbridge Philips Research North America University of Lethbridge Add to Reading List Source URL: emnlp2014.org Language: English - Date: 2014-10-16 05:19:55
288	Policy Communication for Coordination with Unknown Teammates Trevor Sarratt and Arnav Jhala University of California Santa Cruz {tsarratt, jhala}@soe.ucsc.edu Abstract Add to Reading List Source URL: mipc.inf.ed.ac.uk Language: English - Date: 2015-12-23 07:01:08 Multi-agent systems Artificial intelligence Academia Systems science Simulation Belief revision Agent-based model Autonomous Agents and Multi-Agent Systems Reinforcement learning
289	An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning arXiv:1503.04269v1 [cs.LG] 14 MarRichard S. Sutton Add to Reading List Source URL: arxiv.org Language: English - Date: 2015-03-16 20:16:49 Algebra Linear algebra Mathematics Markov models Markov processes Matrix theory Matrices Q-learning Markov chain Matrix Reinforcement learning Temporal difference learning
290	Scaling up Inverse Reinforcement Learning through Instructed Feature Construction Tomas Singliar Dragos D. Margineantu Boeing Research & Technology P.O. Box 3707, M/C 7L-44 Add to Reading List Source URL: snowbird.djvuzone.org Language: English - Date: 2011-02-10 16:50:25

UPDATE